Goto

Collaborating Authors

 news event


Lifelong Evolution: Collaborative Learning between Large and Small Language Models for Continuous Emergent Fake News Detection

Zhou, Ziyi, Zhang, Xiaoming, Zhang, Litian, Zhang, Yibo, Guan, Zhenyu, Li, Chaozhuo, Yu, Philip S.

arXiv.org Artificial Intelligence

--The widespread dissemination of fake news on social media has significantly impacted society, resulting in serious consequences. Conventional deep learning methodologies employing small language models (SLMs) suffer from extensive supervised training requirements and difficulties adapting to evolving news environments due to data scarcity and distribution shifts. EFND) framework to address these challenges. We further introduce a lifelong knowledge editing module based on a Mixture-of-Experts architecture to incrementally update LLMs and a replay-based continue learning method to ensure SLMs retain prior knowledge without retraining entirely. EFND significantly outperforms existed methods, effectively improving detection accuracy and adaptability in continuous emergent fake news scenarios. HE rampant spread of fake news on the Internet has already caused significant societal impact [1]. For instance, the spread of fake news during the Covid-19 pandemic has led to harmful consequences such as drug misuse and incorrect treatment methods [2]. As illustrated in Figure 2(a), fake news on emergent events evolves continuously, presenting a challenge for real-time detection systems to keep pace with its evolution. Furthermore, an alarming pattern known as "rumor resurgence" frequently occurs in social media, wherein past misinformation reappears, perpetuating its societal impact [3]. Chaozhuo Li is with School of Cyber Science and Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China (e-mail: lichaozhuo@bupt.edu.cn).


Talking Point based Ideological Discourse Analysis in News Events

Nakshatri, Nishanth, Mehta, Nikhil, Liu, Siyi, Chen, Sihao, Hopkins, Daniel J., Roth, Dan, Goldwasser, Dan

arXiv.org Artificial Intelligence

Analyzing ideological discourse even in the age of LLMs remains a challenge, as these models often struggle to capture the key elements that shape real-world narratives. Specifically, LLMs fail to focus on characteristic elements driving dominant discourses and lack the ability to integrate contextual information required for understanding abstract ideological views. To address these limitations, we propose a framework motivated by the theory of ideological discourse analysis to analyze news articles related to real-world events. Our framework represents the news articles using a relational structure - talking points, which captures the interaction between entities, their roles, and media frames along with a topic of discussion. It then constructs a vocabulary of repeating themes - prominent talking points, that are used to generate ideology-specific viewpoints (or partisan perspectives). We evaluate our framework's ability to generate these perspectives through automated tasks - ideology and partisan classification tasks, supplemented by human validation. Additionally, we demonstrate straightforward applicability of our framework in creating event snapshots, a visual way of interpreting event discourse. We release resulting dataset and model to the community to support further research.


Breaking Down Financial News Impact: A Novel AI Approach with Geometric Hypergraphs

Harit, Anoushka, Sun, Zhongtian, Yu, Jongmin, Moubayed, Noura Al

arXiv.org Artificial Intelligence

In the fast-paced and volatile financial markets, accurately predicting stock movements based on financial news is critical for investors and analysts. Traditional models often struggle to capture the intricate and dynamic relationships between news events and market reactions, limiting their ability to provide actionable insights. This paper introduces a novel approach leveraging Explainable Artificial Intelligence (XAI) through the development of a Geometric Hypergraph Attention Network (GHAN) to analyze the impact of financial news on market behaviours. Geometric hypergraphs extend traditional graph structures by allowing edges to connect multiple nodes, effectively modelling high-order relationships and interactions among financial entities and news events. This unique capability enables the capture of complex dependencies, such as the simultaneous impact of a single news event on multiple stocks or sectors, which traditional models frequently overlook. By incorporating attention mechanisms within hypergraphs, GHAN enhances the model's ability to focus on the most relevant information, ensuring more accurate predictions and better interpretability. Additionally, we employ BERT-based embeddings to capture the semantic richness of financial news texts, providing a nuanced understanding of the content. Using a comprehensive financial news dataset, our GHAN model addresses key challenges in financial news impact analysis, including the complexity of high-order interactions, the necessity for model interpretability, and the dynamic nature of financial markets. Integrating attention mechanisms and SHAP values within GHAN ensures transparency, highlighting the most influential factors driving market predictions. Empirical validation demonstrates the superior effectiveness of our approach over traditional sentiment analysis and time-series models.


Assessing News Thumbnail Representativeness: Counterfactual text can enhance the cross-modal matching ability

Yoon, Yejun, Yoon, Seunghyun, Park, Kunwoo

arXiv.org Artificial Intelligence

This paper addresses the critical challenge of assessing the representativeness of news thumbnail images, which often serve as the first visual engagement for readers when an article is disseminated on social media. We focus on whether a news image represents the actors discussed in the news text. To serve the challenge, we introduce NewsTT, a manually annotated dataset of 1000 news thumbnail images and text pairs. We found that the pretrained vision and language models, such as BLIP-2, struggle with this task. Since news subjects frequently involve named entities or proper nouns, the pretrained models could have a limited capability to match news actors' visual and textual appearances. We hypothesize that learning to contrast news text with its counterfactual, of which named entities are replaced, can enhance the cross-modal matching ability of vision and language models. We propose CFT-CLIP, a contrastive learning framework that updates vision and language bi-encoders according to the hypothesis. We found that our simple method can boost the performance for assessing news thumbnail representativeness, supporting our assumption. Code and data can be accessed at https://github.com/ssu-humane/news-images-acl24.


Why You Fell for the Fake Pope Coat

The Atlantic - Technology

Being alive and on the internet in 2023 suddenly means seeing hyperrealistic images of famous people doing weird, funny, shocking, and possibly disturbing things that never actually happened. In just the past week, the AI art tool Midjourney rendered two separate convincing, photographlike images of celebrities that both went viral. Last week, it imagined Donald Trump's arrest and eventual escape from jail. Over the weekend, Pope Francis got his turn in Midjourney's maw when an AI-generated image of the pontiff wearing a stylish white puffy jacket blew up on Reddit and Twitter. But the fake Trump arrest and the pope's Balenciaga renderings have one meaningful difference: While most people were quick to disbelieve the images of Trump, the pope's puffer duped even the most discerning internet dwellers. This distinction clarifies how synthetic media--already treated as a fake-news bogeyman by some--will and won't shape our perceptions of reality.


Classification of Cross-cultural News Events

Sittar, Abdul, Mladenic, Dunja

arXiv.org Artificial Intelligence

We present a methodology to support the analysis of culture from text such as news events and demonstrate its usefulness on categorizing news events from different categories (society, business, health, recreation, science, shopping, sports, arts, computers, games and home) across different geographical locations (different places in 117 countries). We group countries based on the culture that they follow and then filter the news events based on their content category. The news events are automatically labelled with the help of Hofstedes cultural dimensions. We present combinations of events across different categories and check the performances of different classification methods. We also presents experimental comparison of different number of features in order to find a suitable set to represent the culture.


Creating "Unbiased News" Using Data Science

#artificialintelligence

I scrapped all their webpages categorized under "stories". AllSides is a brilliant initiative that takes a news event and collects articles written on it by a left leaning, right leaning and center leaning media outlet. They write a summary on this event and briefly mention what is being emphasized on by each of the three outlets. An example of this can be viewed here. They publish pre-established metrics for the contemporary political bias of all major media outlets.


News-Driven Stock Prediction With Attention-Based Noisy Recurrent State Transition

Liu, Xiao, Huang, Heyan, Zhang, Yue, Yuan, Changsen

arXiv.org Artificial Intelligence

We consider direct modeling of underlying stock value movement sequences over time in the news-driven stock movement prediction. A recurrent state transition model is constructed, which better captures a gradual process of stock movement continuously by modeling the correlation between past and future price movements. By separating the effects of news and noise, a noisy random factor is also explicitly fitted based on the recurrent states. Results show that the proposed model outperforms strong baselines. Thanks to the use of attention over news events, our model is also more explainable. To our knowledge, we are the first to explicitly model both events and noise over a fundamental stock value state for news-driven stock movement prediction.


Mining News Events from Comparable News Corpora: A Multi-Attribute Proximity Network Modeling Approach

Kim, Hyungsul, El-Kishky, Ahmed, Ren, Xiang, Han, Jiawei

arXiv.org Machine Learning

We present ProxiModel, a novel event mining framework for extracting high-quality structured event knowledge from large, redundant, and noisy news data sources. The proposed model differentiates itself from other approaches by modeling both the event correlation within each individual document as well as across the corpus. To facilitate this, we introduce the concept of a proximity-network, a novel space-efficient data structure to facilitate scalable event mining. This proximity network captures the corpus-level co-occurence statistics for candidate event descriptors, event attributes, as well as their connections. We probabilistically model the proximity network as a generative process with sparsity-inducing regularization. This allows us to efficiently and effectively extract high-quality and interpretable news events. Experiments on three different news corpora demonstrate that the proposed method is effective and robust at generating high-quality event descriptors and attributes. We briefly detail many interesting applications from our proposed framework such as news summarization, event tracking and multi-dimensional analysis on news. Finally, we explore a case study on visualizing the events for a Japan Tsunami news corpus and demonstrate ProxiModel's ability to automatically summarize emerging news events.


Enhancing Stock Market Prediction with Extended Coupled Hidden Markov Model over Multi-Sourced Data

Zhang, Xi, Li, Yixuan, Wang, Senzhang, Fang, Binxing, Yu, Philip S.

arXiv.org Machine Learning

Noname manuscript No. (will be inserted by the editor) Abstract Traditional stock market prediction methods commonly only utilize the historical trading data, ignoring the fact that stock market fluctuations can be impacted by various other information sources such as stock related events. Although some recent works propose event-driven prediction approaches by considering the event data, how to leverage the joint impacts of multiple data sources still remains an open research problem. In this work, we study how to explore multiple data sources to improve the performance of the stock prediction. We introduce an Extended Coupled Hidden Markov Model incorporating the news events with the historical trading data. To address the data sparsity issue of news events for each single stock, we further study the fluctuation correlations between the stocks and incorporate the correlations into the model to facilitate the prediction task. Keywords Stock prediction · Event extraction · Information fusion · Hidden Markov Model 1 Introduction The capability of predicting the stock price movement directions can offer enormous arbitrage profit opportunities and thus attract much attention from both academia and industry. Conventional quantitative trading prediction methods are mostly based on the historical trading data such as prices and volumes. According to the Efficient Market Hypothesis (EMH) [16], stock prices are the reflection of all known information. Key Laboratory of Trustworthy Distributed Computing and Service (Beijing University of Posts and Telecommunications), Ministry of Education, Beijing, China. As more and more investors obtain information from social media [49, 57], the indicators obtained from Web news articles and social networks can also have significant impacts on the stock prices, and thus such factors that can derive the stock price fluctuations must be considered. As such, there are growing research interests in exploring financial text documents such as news articles, financial standings to facilitate the stock prediction task.